NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

AdaWM: Adaptive World Model based Planning for Autonomous Driving

Wang, Hang; Ye, Xin; Tao, Feng; Pan, Chenbin; Mallik, Abhirup; Yaman, Burhaneddin; Ren, Liu Ren; Zhang, Junshan (April 2025, https://iclr.cc/)

World model based reinforcement learning (RL) has emerged as a promising approach for autonomous driving, which learns a latent dynamics model and uses it to train a planning policy. To speed up the learning process, the pretrain-finetune paradigm is often used, where online RL is initialized by a pretrained model and a policy learned offline. However, naively performing such initialization in RL may result in dramatic performance degradation during the online interactions in the new task. To tackle this challenge, we first analyze the performance degradation and identify two primary root causes therein: the mismatch of the planning policy and the mismatch of the dynamics model, due to distribution shift. We further analyze the effects of these factors on performance degradation during finetuning, and our findings reveal that the choice of finetuning strategies plays a pivotal role in mitigating these effects. We then introduce AdaWM, an Adaptive World Model based planning method, featuring two key steps: (a) mismatch identification, which quantifies the mismatches and informs the finetuning strategy, and (b) alignment-driven finetuning, which selectively updates either the policy or the model as needed using efficient low-rank updates. Extensive experiments on the challenging CARLA driving tasks demonstrate that AdaWM significantly improves the finetuning process, resulting in more robust and efficient .
more » « less
Free, publicly-accessible full text available April 24, 2026
AdaWM: Adaptive World Model based Planning for Autonomous Driving

Wang, Hang; Ye, Xin; Tao, Feng; Pan, Chenbin; Mallik, Abhirup; Yaman, Burhaneddin; Ren, Liu Ren; Zhang, Junshan (April 2025, https://iclr.cc/)

World model based reinforcement learning (RL) has emerged as a promising approach for autonomous driving, which learns a latent dynamics model and uses it to train a planning policy. To speed up the learning process, the pretrain-finetune paradigm is often used, where online RL is initialized by a pretrained model and a policy learned offline. However, naively performing such initialization in RL may result in dramatic performance degradation during the online interactions in the new task. To tackle this challenge, we first analyze the performance degradation and identify two primary root causes therein: the mismatch of the planning policy and the mismatch of the dynamics model, due to distribution shift. We further analyze the effects of these factors on performance degradation during finetuning, and our findings reveal that the choice of finetuning strategies plays a pivotal role in mitigating these effects. We then introduce AdaWM, an Adaptive World Model based planning method, featuring two key steps: (a) mismatch identification, which quantifies the mismatches and informs the finetuning strategy, and (b) alignment-driven finetuning, which selectively updates either the policy or the model as needed using efficient low-rank updates. Extensive experiments on the challenging CARLA driving tasks demonstrate that AdaWM significantly improves the finetuning process, resulting in more robust and efficient .
more » « less
Free, publicly-accessible full text available April 24, 2026
PT-CapsNet: A Novel Prediction-Tuning Capsule Network Suitable for Deeper Architectures

https://doi.org/10.1109/ICCV48922.2021.01178

Pan, Chenbin; Velipasalar, Senem (October 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV))

Full Text Available

Search for: All records